QueryCat: automatic categorization of MEDLINE queries

نویسندگان

  • Wanda Pratt
  • Henry Wasserman
چکیده

A searcher's inability to formulate an appropriate query can result in an overwhelming number of retrieved documents. Our approach to this problem is to use information about common types or categories of queries to (1) reformulate the user's initial query and (2) create an informative organization of the retrieved documents from the reformulated query. To achieve these goals, we first must identify which common categories or types of queries are the best abstraction of the user's specific query. In this paper, we describe a system that performs this first step of categorizing the user's query. Our system uses a two-phased approach: a lexical analysis phase, and a semantic analysis phase. An evaluation of our system demonstrates that its query categorization corresponds reasonably well to the query categorizations by medical librarians and physicians.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Query Translation by Text Categorization

We report on the development of a cross language information retrieval system, which translates user queries by categorizing these queries into terms listed in a controlled vocabulary. Unlike usual automatic text categorization systems, which rely on dataintensive models induced from large training data, our automatic text categorization tool applies data-independent classifiers: a vector-space...

متن کامل

Query and Document Translation by Automatic Text Categorization: A Simple Approach to Establish a Strong Textual Baseline for ImageCLEFmed 2006

In this paper, we report on the fusion of simple retrieval strategies with thesaural resources in order to perform document and query translation for cross–language retrieval in a collection of medical cases. The collection contains textual and visual contents. In this paper, we focus on the textual contents of the collection, which contains documents in three languages: French, English and Ger...

متن کامل

Automatic Text Categorization and Its Applicationto Text

We develop an automatic text categorization approach and investigate its application to text retrieval. The categorization approach is derived from a combination of a learning paradigm known as instancebased learning and an advanced document retrieval technique known as retrieval feedback. We demonstrate the e ectiveness of our categorization approach using two real-world document collections f...

متن کامل

Automatic Text Categorization and Its Application to Text Retrieval

ÐWe develop an automatic text categorization approach and investigate its application to text retrieval. The categorization approach is derived from a combination of a learning paradigm known as instance-based learning and an advanced document retrieval technique known as retrieval feedback. We demonstrate the effectiveness of our categorization approach using two realworld document collections...

متن کامل

Using Discourse Analysis to Improve Text Categorization in MEDLINE

PROBLEM Automatic keyword assignment has been largely studied in medical informatics in the context of the MEDLINE database, both for helping search in MEDLINE and in order to provide an indicative "gist" of the content of an article. Automatic assignment of Medical Subject Headings (MeSH), which is formally an automatic text categorization task, has been proposed using different methods or com...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Proceedings. AMIA Symposium

دوره   شماره 

صفحات  -

تاریخ انتشار 2000